Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 426880 |
| Missing cells | 1215152 |
| Missing cells (%) | 15.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 58.6 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 4 |
|---|---|
| Text | 4 |
| Categorical | 10 |
drive is highly overall correlated with type | High correlation |
odometer is highly overall correlated with year | High correlation |
type is highly overall correlated with drive | High correlation |
year is highly overall correlated with odometer | High correlation |
fuel is highly imbalanced (62.7%) | Imbalance |
title_status is highly imbalanced (89.9%) | Imbalance |
manufacturer has 17646 (4.1%) missing values | Missing |
model has 5277 (1.2%) missing values | Missing |
condition has 174104 (40.8%) missing values | Missing |
cylinders has 177678 (41.6%) missing values | Missing |
odometer has 4400 (1.0%) missing values | Missing |
title_status has 8242 (1.9%) missing values | Missing |
VIN has 161042 (37.7%) missing values | Missing |
drive has 130567 (30.6%) missing values | Missing |
size has 306361 (71.8%) missing values | Missing |
type has 92858 (21.8%) missing values | Missing |
paint_color has 130203 (30.5%) missing values | Missing |
price is highly skewed (γ1 = 254.4069323) | Skewed |
odometer is highly skewed (γ1 = 38.04001486) | Skewed |
id has unique values | Unique |
price has 32895 (7.7%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-20 03:53:05.951485 |
|---|---|
| Analysis finished | 2025-04-20 03:53:18.424344 |
| Duration | 12.47 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
Unique 
| Distinct | 426880 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.3114866 × 109 |
| Minimum | 7.2074081 × 109 |
|---|---|
| Maximum | 7.3171011 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 7.2074081 × 109 |
|---|---|
| 5-th percentile | 7.3031501 × 109 |
| Q1 | 7.3081433 × 109 |
| median | 7.3126208 × 109 |
| Q3 | 7.3152535 × 109 |
| 95-th percentile | 7.3167433 × 109 |
| Maximum | 7.3171011 × 109 |
| Range | 1.0969296 × 108 |
| Interquartile range (IQR) | 7110204.2 |
Descriptive statistics
| Standard deviation | 4473170.4 |
|---|---|
| Coefficient of variation (CV) | 0.0006118004 |
| Kurtosis | 17.057761 |
| Mean | 7.3114866 × 109 |
| Median Absolute Deviation (MAD) | 3096588 |
| Skewness | -1.4301233 |
| Sum | 3.1211274 × 1015 |
| Variance | 2.0009254 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7222695916 | 1 | < 0.1% |
| 7313139418 | 1 | < 0.1% |
| 7313423023 | 1 | < 0.1% |
| 7313423324 | 1 | < 0.1% |
| 7313424533 | 1 | < 0.1% |
| 7313425823 | 1 | < 0.1% |
| 7313426990 | 1 | < 0.1% |
| 7313427132 | 1 | < 0.1% |
| 7313426423 | 1 | < 0.1% |
| 7313426503 | 1 | < 0.1% |
| Other values (426870) | 426870 |
| Value | Count | Frequency (%) |
| 7207408119 | 1 | |
| 7208549803 | 1 | |
| 7209027818 | 1 | |
| 7209054699 | 1 | |
| 7209064557 | 1 | |
| 7210384030 | 1 | |
| 7212512589 | 1 | |
| 7212631321 | 1 | |
| 7213839225 | 1 | |
| 7213843538 | 1 |
| Value | Count | Frequency (%) |
| 7317101084 | 1 | |
| 7317098990 | 1 | |
| 7317098055 | 1 | |
| 7317096748 | 1 | |
| 7317096685 | 1 | |
| 7317096571 | 1 | |
| 7317096373 | 1 | |
| 7317096141 | 1 | |
| 7317096101 | 1 | |
| 7317096069 | 1 |
region
Text
| Distinct | 404 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 20 |
| Mean length | 11.44423 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | prescott |
|---|---|
| 2nd row | fayetteville |
| 3rd row | florida keys |
| 4th row | worcester / central MA |
| 5th row | greensboro |
| Value | Count | Frequency (%) |
| 64305 | 8.6% | |
| city | 12302 | 1.6% |
| new | 9171 | 1.2% |
| bay | 8365 | 1.1% |
| st | 7915 | 1.1% |
| san | 7639 | 1.0% |
| south | 7598 | 1.0% |
| county | 6893 | 0.9% |
| jersey | 6781 | 0.9% |
| fort | 6553 | 0.9% |
| Other values (491) | 610100 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 476629 | 9.8% |
| e | 409823 | 8.4% |
| o | 365776 | 7.5% |
| n | 348420 | 7.1% |
| 320742 | 6.6% | |
| s | 315838 | 6.5% |
| l | 303305 | 6.2% |
| t | 284497 | 5.8% |
| r | 283439 | 5.8% |
| i | 276202 | 5.7% |
| Other values (45) | 1500642 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4885313 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 476629 | 9.8% |
| e | 409823 | 8.4% |
| o | 365776 | 7.5% |
| n | 348420 | 7.1% |
| 320742 | 6.6% | |
| s | 315838 | 6.5% |
| l | 303305 | 6.2% |
| t | 284497 | 5.8% |
| r | 283439 | 5.8% |
| i | 276202 | 5.7% |
| Other values (45) | 1500642 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4885313 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 476629 | 9.8% |
| e | 409823 | 8.4% |
| o | 365776 | 7.5% |
| n | 348420 | 7.1% |
| 320742 | 6.6% | |
| s | 315838 | 6.5% |
| l | 303305 | 6.2% |
| t | 284497 | 5.8% |
| r | 283439 | 5.8% |
| i | 276202 | 5.7% |
| Other values (45) | 1500642 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4885313 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 476629 | 9.8% |
| e | 409823 | 8.4% |
| o | 365776 | 7.5% |
| n | 348420 | 7.1% |
| 320742 | 6.6% | |
| s | 315838 | 6.5% |
| l | 303305 | 6.2% |
| t | 284497 | 5.8% |
| r | 283439 | 5.8% |
| i | 276202 | 5.7% |
| Other values (45) | 1500642 |
price
Real number (ℝ)
Skewed  Zeros 
| Distinct | 15655 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75199.033 |
| Minimum | 0 |
|---|---|
| Maximum | 3.7369287 × 109 |
| Zeros | 32895 |
| Zeros (%) | 7.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5900 |
| median | 13950 |
| Q3 | 26485.75 |
| 95-th percentile | 44500 |
| Maximum | 3.7369287 × 109 |
| Range | 3.7369287 × 109 |
| Interquartile range (IQR) | 20585.75 |
Descriptive statistics
| Standard deviation | 12182282 |
|---|---|
| Coefficient of variation (CV) | 162.00052 |
| Kurtosis | 69205.089 |
| Mean | 75199.033 |
| Median Absolute Deviation (MAD) | 9450 |
| Skewness | 254.40693 |
| Sum | 3.2100963 × 1010 |
| Variance | 1.48408 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 32895 | 7.7% |
| 6995 | 3169 | 0.7% |
| 7995 | 3129 | 0.7% |
| 9995 | 2867 | 0.7% |
| 8995 | 2837 | 0.7% |
| 4500 | 2778 | 0.7% |
| 5995 | 2727 | 0.6% |
| 3500 | 2716 | 0.6% |
| 29990 | 2705 | 0.6% |
| 6500 | 2594 | 0.6% |
| Other values (15645) | 368463 |
| Value | Count | Frequency (%) |
| 0 | 32895 | |
| 1 | 1951 | 0.5% |
| 2 | 13 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 16 | < 0.1% |
| 6 | 12 | < 0.1% |
| 7 | 8 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 3736928711 | 2 | < 0.1% |
| 3024942282 | 2 | < 0.1% |
| 3009548743 | 1 | < 0.1% |
| 1410065407 | 1 | < 0.1% |
| 1234567890 | 1 | < 0.1% |
| 1111111111 | 2 | < 0.1% |
| 987654321 | 2 | < 0.1% |
| 135008900 | 1 | < 0.1% |
| 123456789 | 6 | |
| 113456789 | 1 | < 0.1% |
year
Real number (ℝ)
High correlation 
| Distinct | 114 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1205 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2011.2352 |
| Minimum | 1900 |
|---|---|
| Maximum | 2022 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1998 |
| Q1 | 2008 |
| median | 2013 |
| Q3 | 2017 |
| 95-th percentile | 2020 |
| Maximum | 2022 |
| Range | 122 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 9.4521196 |
|---|---|
| Coefficient of variation (CV) | 0.004699659 |
| Kurtosis | 19.579889 |
| Mean | 2011.2352 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -3.5779204 |
| Sum | 8.5613254 × 108 |
| Variance | 89.342565 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 36420 | 8.5% |
| 2018 | 36369 | 8.5% |
| 2015 | 31538 | 7.4% |
| 2013 | 30794 | 7.2% |
| 2016 | 30434 | 7.1% |
| 2014 | 30283 | 7.1% |
| 2019 | 25375 | 5.9% |
| 2012 | 23898 | 5.6% |
| 2011 | 20341 | 4.8% |
| 2020 | 19298 | 4.5% |
| Other values (104) | 140925 |
| Value | Count | Frequency (%) |
| 1900 | 12 | |
| 1901 | 3 | < 0.1% |
| 1902 | 1 | < 0.1% |
| 1903 | 12 | |
| 1905 | 1 | < 0.1% |
| 1909 | 1 | < 0.1% |
| 1910 | 2 | < 0.1% |
| 1913 | 2 | < 0.1% |
| 1915 | 1 | < 0.1% |
| 1916 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2022 | 133 | < 0.1% |
| 2021 | 2396 | 0.6% |
| 2020 | 19298 | |
| 2019 | 25375 | |
| 2018 | 36369 | |
| 2017 | 36420 | |
| 2016 | 30434 | |
| 2015 | 31538 | |
| 2014 | 30283 | |
| 2013 | 30794 |
manufacturer
Categorical
Missing 
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17646 |
| Missing (%) | 4.1% |
| Memory size | 3.3 MiB |
| ford | |
|---|---|
| chevrolet | |
| toyota | |
| honda | |
| nissan | 19067 |
| Other values (37) |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 5.7946578 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | gmc |
|---|---|
| 2nd row | chevrolet |
| 3rd row | chevrolet |
| 4th row | toyota |
| 5th row | ford |
Common Values
| Value | Count | Frequency (%) |
| ford | 70985 | |
| chevrolet | 55064 | |
| toyota | 34202 | 8.0% |
| honda | 21269 | 5.0% |
| nissan | 19067 | 4.5% |
| jeep | 19014 | 4.5% |
| ram | 18342 | 4.3% |
| gmc | 16785 | 3.9% |
| bmw | 14699 | 3.4% |
| dodge | 13707 | 3.2% |
| Other values (32) | 126100 | |
| (Missing) | 17646 | 4.1% |
Length
| Value | Count | Frequency (%) |
| ford | 70985 | |
| chevrolet | 55064 | |
| toyota | 34202 | 8.4% |
| honda | 21269 | 5.2% |
| nissan | 19067 | 4.7% |
| jeep | 19014 | 4.6% |
| ram | 18342 | 4.5% |
| gmc | 16785 | 4.1% |
| bmw | 14699 | 3.6% |
| dodge | 13707 | 3.3% |
| Other values (32) | 126121 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 257522 | 10.9% |
| e | 239422 | 10.1% |
| r | 196161 | 8.3% |
| a | 186064 | 7.8% |
| d | 162166 | 6.8% |
| t | 136711 | 5.8% |
| c | 124158 | 5.2% |
| n | 114989 | 4.8% |
| l | 106299 | 4.5% |
| i | 99297 | 4.2% |
| Other values (17) | 748582 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2371371 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 257522 | 10.9% |
| e | 239422 | 10.1% |
| r | 196161 | 8.3% |
| a | 186064 | 7.8% |
| d | 162166 | 6.8% |
| t | 136711 | 5.8% |
| c | 124158 | 5.2% |
| n | 114989 | 4.8% |
| l | 106299 | 4.5% |
| i | 99297 | 4.2% |
| Other values (17) | 748582 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2371371 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 257522 | 10.9% |
| e | 239422 | 10.1% |
| r | 196161 | 8.3% |
| a | 186064 | 7.8% |
| d | 162166 | 6.8% |
| t | 136711 | 5.8% |
| c | 124158 | 5.2% |
| n | 114989 | 4.8% |
| l | 106299 | 4.5% |
| i | 99297 | 4.2% |
| Other values (17) | 748582 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2371371 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 257522 | 10.9% |
| e | 239422 | 10.1% |
| r | 196161 | 8.3% |
| a | 186064 | 7.8% |
| d | 162166 | 6.8% |
| t | 136711 | 5.8% |
| c | 124158 | 5.2% |
| n | 114989 | 4.8% |
| l | 106299 | 4.5% |
| i | 99297 | 4.2% |
| Other values (17) | 748582 |
model
Text
Missing 
| Distinct | 29649 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 5277 |
| Missing (%) | 1.2% |
| Memory size | 3.3 MiB |
Length
| Max length | 203 |
|---|---|
| Median length | 177 |
| Mean length | 11.91973 |
| Min length | 1 |
Unique
| Unique | 15290 ? |
|---|---|
| Unique (%) | 3.6% |
Sample
| 1st row | sierra 1500 crew cab slt |
|---|---|
| 2nd row | silverado 1500 |
| 3rd row | silverado 1500 crew |
| 4th row | tundra double cab sr |
| 5th row | f-150 xlt |
| Value | Count | Frequency (%) |
| 1500 | 24082 | 2.6% |
| sport | 23261 | 2.6% |
| 4d | 18645 | 2.1% |
| silverado | 17396 | 1.9% |
| sedan | 15508 | 1.7% |
| cab | 15224 | 1.7% |
| f-150 | 10417 | 1.1% |
| 4x4 | 9664 | 1.1% |
| grand | 8913 | 1.0% |
| sierra | 8703 | 1.0% |
| Other values (8692) | 757366 |
Most occurring characters
| Value | Count | Frequency (%) |
| 487722 | 9.7% | |
| e | 395746 | 7.9% |
| a | 371389 | 7.4% |
| r | 366159 | 7.3% |
| s | 278320 | 5.5% |
| t | 259161 | 5.2% |
| i | 237544 | 4.7% |
| o | 228379 | 4.5% |
| l | 212188 | 4.2% |
| c | 206496 | 4.1% |
| Other values (107) | 1982290 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5025394 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 487722 | 9.7% | |
| e | 395746 | 7.9% |
| a | 371389 | 7.4% |
| r | 366159 | 7.3% |
| s | 278320 | 5.5% |
| t | 259161 | 5.2% |
| i | 237544 | 4.7% |
| o | 228379 | 4.5% |
| l | 212188 | 4.2% |
| c | 206496 | 4.1% |
| Other values (107) | 1982290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5025394 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 487722 | 9.7% | |
| e | 395746 | 7.9% |
| a | 371389 | 7.4% |
| r | 366159 | 7.3% |
| s | 278320 | 5.5% |
| t | 259161 | 5.2% |
| i | 237544 | 4.7% |
| o | 228379 | 4.5% |
| l | 212188 | 4.2% |
| c | 206496 | 4.1% |
| Other values (107) | 1982290 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5025394 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 487722 | 9.7% | |
| e | 395746 | 7.9% |
| a | 371389 | 7.4% |
| r | 366159 | 7.3% |
| s | 278320 | 5.5% |
| t | 259161 | 5.2% |
| i | 237544 | 4.7% |
| o | 228379 | 4.5% |
| l | 212188 | 4.2% |
| c | 206496 | 4.1% |
| Other values (107) | 1982290 |
condition
Categorical
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 174104 |
| Missing (%) | 40.8% |
| Memory size | 3.3 MiB |
| good | |
|---|---|
| excellent | |
| like new | |
| fair | 6769 |
| new | 1305 |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 6.3441506 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | good |
|---|---|
| 2nd row | good |
| 3rd row | good |
| 4th row | good |
| 5th row | excellent |
Common Values
| Value | Count | Frequency (%) |
| good | 121456 | |
| excellent | 101467 | |
| like new | 21178 | 5.0% |
| fair | 6769 | 1.6% |
| new | 1305 | 0.3% |
| salvage | 601 | 0.1% |
| (Missing) | 174104 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| good | 121456 | |
| excellent | 101467 | |
| new | 22483 | 8.2% |
| like | 21178 | 7.7% |
| fair | 6769 | 2.5% |
| salvage | 601 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 348663 | |
| o | 242912 | |
| l | 224713 | |
| n | 123950 | 7.7% |
| g | 122057 | 7.6% |
| d | 121456 | 7.6% |
| x | 101467 | 6.3% |
| c | 101467 | 6.3% |
| t | 101467 | 6.3% |
| i | 27947 | 1.7% |
| Other values (8) | 87550 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1603649 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 348663 | |
| o | 242912 | |
| l | 224713 | |
| n | 123950 | 7.7% |
| g | 122057 | 7.6% |
| d | 121456 | 7.6% |
| x | 101467 | 6.3% |
| c | 101467 | 6.3% |
| t | 101467 | 6.3% |
| i | 27947 | 1.7% |
| Other values (8) | 87550 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1603649 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 348663 | |
| o | 242912 | |
| l | 224713 | |
| n | 123950 | 7.7% |
| g | 122057 | 7.6% |
| d | 121456 | 7.6% |
| x | 101467 | 6.3% |
| c | 101467 | 6.3% |
| t | 101467 | 6.3% |
| i | 27947 | 1.7% |
| Other values (8) | 87550 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1603649 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 348663 | |
| o | 242912 | |
| l | 224713 | |
| n | 123950 | 7.7% |
| g | 122057 | 7.6% |
| d | 121456 | 7.6% |
| x | 101467 | 6.3% |
| c | 101467 | 6.3% |
| t | 101467 | 6.3% |
| i | 27947 | 1.7% |
| Other values (8) | 87550 | 5.5% |
cylinders
Categorical
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 177678 |
| Missing (%) | 41.6% |
| Memory size | 3.3 MiB |
| 6 cylinders | |
|---|---|
| 4 cylinders | |
| 8 cylinders | |
| 5 cylinders | 1712 |
| 10 cylinders | 1455 |
| Other values (3) | 2162 |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.975426 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 8 cylinders |
|---|---|
| 2nd row | 8 cylinders |
| 3rd row | 8 cylinders |
| 4th row | 8 cylinders |
| 5th row | 6 cylinders |
Common Values
| Value | Count | Frequency (%) |
| 6 cylinders | 94169 | |
| 4 cylinders | 77642 | |
| 8 cylinders | 72062 | |
| 5 cylinders | 1712 | 0.4% |
| 10 cylinders | 1455 | 0.3% |
| other | 1298 | 0.3% |
| 3 cylinders | 655 | 0.2% |
| 12 cylinders | 209 | < 0.1% |
| (Missing) | 177678 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cylinders | 247904 | |
| 6 | 94169 | 18.9% |
| 4 | 77642 | 15.6% |
| 8 | 72062 | 14.5% |
| 5 | 1712 | 0.3% |
| 10 | 1455 | 0.3% |
| other | 1298 | 0.3% |
| 3 | 655 | 0.1% |
| 12 | 209 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 249202 | |
| r | 249202 | |
| s | 247904 | |
| 247904 | ||
| c | 247904 | |
| y | 247904 | |
| l | 247904 | |
| i | 247904 | |
| n | 247904 | |
| d | 247904 | |
| Other values (11) | 253462 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2735098 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 249202 | |
| r | 249202 | |
| s | 247904 | |
| 247904 | ||
| c | 247904 | |
| y | 247904 | |
| l | 247904 | |
| i | 247904 | |
| n | 247904 | |
| d | 247904 | |
| Other values (11) | 253462 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2735098 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 249202 | |
| r | 249202 | |
| s | 247904 | |
| 247904 | ||
| c | 247904 | |
| y | 247904 | |
| l | 247904 | |
| i | 247904 | |
| n | 247904 | |
| d | 247904 | |
| Other values (11) | 253462 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2735098 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 249202 | |
| r | 249202 | |
| s | 247904 | |
| 247904 | ||
| c | 247904 | |
| y | 247904 | |
| l | 247904 | |
| i | 247904 | |
| n | 247904 | |
| d | 247904 | |
| Other values (11) | 253462 |
fuel
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3013 |
| Missing (%) | 0.7% |
| Memory size | 3.3 MiB |
| gas | |
|---|---|
| other | 30728 |
| diesel | 30062 |
| hybrid | 5170 |
| electric | 1698 |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.41438 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | gas |
|---|---|
| 2nd row | gas |
| 3rd row | gas |
| 4th row | gas |
| 5th row | gas |
Common Values
| Value | Count | Frequency (%) |
| gas | 356209 | |
| other | 30728 | 7.2% |
| diesel | 30062 | 7.0% |
| hybrid | 5170 | 1.2% |
| electric | 1698 | 0.4% |
| (Missing) | 3013 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| gas | 356209 | |
| other | 30728 | 7.2% |
| diesel | 30062 | 7.1% |
| hybrid | 5170 | 1.2% |
| electric | 1698 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 386271 | |
| g | 356209 | |
| a | 356209 | |
| e | 94248 | 6.5% |
| r | 37596 | 2.6% |
| i | 36930 | 2.6% |
| h | 35898 | 2.5% |
| d | 35232 | 2.4% |
| t | 32426 | 2.2% |
| l | 31760 | 2.2% |
| Other values (4) | 44464 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1447243 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 386271 | |
| g | 356209 | |
| a | 356209 | |
| e | 94248 | 6.5% |
| r | 37596 | 2.6% |
| i | 36930 | 2.6% |
| h | 35898 | 2.5% |
| d | 35232 | 2.4% |
| t | 32426 | 2.2% |
| l | 31760 | 2.2% |
| Other values (4) | 44464 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1447243 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 386271 | |
| g | 356209 | |
| a | 356209 | |
| e | 94248 | 6.5% |
| r | 37596 | 2.6% |
| i | 36930 | 2.6% |
| h | 35898 | 2.5% |
| d | 35232 | 2.4% |
| t | 32426 | 2.2% |
| l | 31760 | 2.2% |
| Other values (4) | 44464 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1447243 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 386271 | |
| g | 356209 | |
| a | 356209 | |
| e | 94248 | 6.5% |
| r | 37596 | 2.6% |
| i | 36930 | 2.6% |
| h | 35898 | 2.5% |
| d | 35232 | 2.4% |
| t | 32426 | 2.2% |
| l | 31760 | 2.2% |
| Other values (4) | 44464 | 3.1% |
odometer
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 104870 |
|---|---|
| Distinct (%) | 24.8% |
| Missing | 4400 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98043.331 |
| Minimum | 0 |
|---|---|
| Maximum | 10000000 |
| Zeros | 1965 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6318 |
| Q1 | 37704 |
| median | 85548 |
| Q3 | 133542.5 |
| 95-th percentile | 204000 |
| Maximum | 10000000 |
| Range | 10000000 |
| Interquartile range (IQR) | 95838.5 |
Descriptive statistics
| Standard deviation | 213881.5 |
|---|---|
| Coefficient of variation (CV) | 2.1814997 |
| Kurtosis | 1690.7574 |
| Mean | 98043.331 |
| Median Absolute Deviation (MAD) | 47910.5 |
| Skewness | 38.040015 |
| Sum | 4.1421347 × 1010 |
| Variance | 4.5745296 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000 | 2263 | 0.5% |
| 1 | 2246 | 0.5% |
| 0 | 1965 | 0.5% |
| 200000 | 1728 | 0.4% |
| 150000 | 1603 | 0.4% |
| 160000 | 1250 | 0.3% |
| 140000 | 1244 | 0.3% |
| 130000 | 1204 | 0.3% |
| 120000 | 1199 | 0.3% |
| 180000 | 1062 | 0.2% |
| Other values (104860) | 406716 | |
| (Missing) | 4400 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 1965 | |
| 1 | 2246 | |
| 2 | 153 | < 0.1% |
| 3 | 58 | < 0.1% |
| 4 | 138 | < 0.1% |
| 5 | 193 | < 0.1% |
| 6 | 33 | < 0.1% |
| 7 | 69 | < 0.1% |
| 8 | 37 | < 0.1% |
| 9 | 38 | < 0.1% |
| Value | Count | Frequency (%) |
| 10000000 | 50 | |
| 9999999 | 88 | |
| 9876543 | 1 | < 0.1% |
| 9750924 | 1 | < 0.1% |
| 9099999 | 1 | < 0.1% |
| 9000000 | 3 | < 0.1% |
| 8888888 | 4 | < 0.1% |
| 8765548 | 1 | < 0.1% |
| 8675309 | 1 | < 0.1% |
| 8393929 | 1 | < 0.1% |
title_status
Categorical
Imbalance  Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8242 |
| Missing (%) | 1.9% |
| Memory size | 3.3 MiB |
| clean | |
|---|---|
| rebuilt | 7219 |
| salvage | 3868 |
| lien | 1422 |
| missing | 814 |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 5.0558239 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | clean |
|---|---|
| 2nd row | clean |
| 3rd row | clean |
| 4th row | clean |
| 5th row | clean |
Common Values
| Value | Count | Frequency (%) |
| clean | 405117 | |
| rebuilt | 7219 | 1.7% |
| salvage | 3868 | 0.9% |
| lien | 1422 | 0.3% |
| missing | 814 | 0.2% |
| parts only | 198 | < 0.1% |
| (Missing) | 8242 | 1.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| clean | 405117 | |
| rebuilt | 7219 | 1.7% |
| salvage | 3868 | 0.9% |
| lien | 1422 | 0.3% |
| missing | 814 | 0.2% |
| parts | 198 | < 0.1% |
| only | 198 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 417824 | |
| e | 417626 | |
| a | 413051 | |
| n | 407551 | |
| c | 405117 | |
| i | 10269 | 0.5% |
| t | 7417 | 0.4% |
| r | 7417 | 0.4% |
| u | 7219 | 0.3% |
| b | 7219 | 0.3% |
| Other values (8) | 15850 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2116560 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 417824 | |
| e | 417626 | |
| a | 413051 | |
| n | 407551 | |
| c | 405117 | |
| i | 10269 | 0.5% |
| t | 7417 | 0.4% |
| r | 7417 | 0.4% |
| u | 7219 | 0.3% |
| b | 7219 | 0.3% |
| Other values (8) | 15850 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2116560 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 417824 | |
| e | 417626 | |
| a | 413051 | |
| n | 407551 | |
| c | 405117 | |
| i | 10269 | 0.5% |
| t | 7417 | 0.4% |
| r | 7417 | 0.4% |
| u | 7219 | 0.3% |
| b | 7219 | 0.3% |
| Other values (8) | 15850 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2116560 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 417824 | |
| e | 417626 | |
| a | 413051 | |
| n | 407551 | |
| c | 405117 | |
| i | 10269 | 0.5% |
| t | 7417 | 0.4% |
| r | 7417 | 0.4% |
| u | 7219 | 0.3% |
| b | 7219 | 0.3% |
| Other values (8) | 15850 | 0.7% |
transmission
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2556 |
| Missing (%) | 0.6% |
| Memory size | 3.3 MiB |
| automatic | |
|---|---|
| other | |
| manual | 25118 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.2315259 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | other |
|---|---|
| 2nd row | other |
| 3rd row | other |
| 4th row | other |
| 5th row | automatic |
Common Values
| Value | Count | Frequency (%) |
| automatic | 336524 | |
| other | 62682 | 14.7% |
| manual | 25118 | 5.9% |
| (Missing) | 2556 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| automatic | 336524 | |
| other | 62682 | 14.8% |
| manual | 25118 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 735730 | |
| a | 723284 | |
| o | 399206 | |
| u | 361642 | |
| m | 361642 | |
| i | 336524 | |
| c | 336524 | |
| h | 62682 | 1.8% |
| e | 62682 | 1.8% |
| r | 62682 | 1.8% |
| Other values (2) | 50236 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3492834 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 735730 | |
| a | 723284 | |
| o | 399206 | |
| u | 361642 | |
| m | 361642 | |
| i | 336524 | |
| c | 336524 | |
| h | 62682 | 1.8% |
| e | 62682 | 1.8% |
| r | 62682 | 1.8% |
| Other values (2) | 50236 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3492834 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 735730 | |
| a | 723284 | |
| o | 399206 | |
| u | 361642 | |
| m | 361642 | |
| i | 336524 | |
| c | 336524 | |
| h | 62682 | 1.8% |
| e | 62682 | 1.8% |
| r | 62682 | 1.8% |
| Other values (2) | 50236 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3492834 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 735730 | |
| a | 723284 | |
| o | 399206 | |
| u | 361642 | |
| m | 361642 | |
| i | 336524 | |
| c | 336524 | |
| h | 62682 | 1.8% |
| e | 62682 | 1.8% |
| r | 62682 | 1.8% |
| Other values (2) | 50236 | 1.4% |
VIN
Text
Missing 
| Distinct | 118246 |
|---|---|
| Distinct (%) | 44.5% |
| Missing | 161042 |
| Missing (%) | 37.7% |
| Memory size | 3.3 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 17 |
| Mean length | 16.958757 |
| Min length | 1 |
Unique
| Unique | 77966 ? |
|---|---|
| Unique (%) | 29.3% |
Sample
| 1st row | 3GTP1VEC4EG551563 |
|---|---|
| 2nd row | 1GCSCSE06AZ123805 |
| 3rd row | 3GCPWCED5LG130317 |
| 4th row | 5TFRM5F17HX120972 |
| 5th row | 1GT220CG8CZ231238 |
| Value | Count | Frequency (%) |
| 1fmju1jt1hea52352 | 261 | 0.1% |
| 3c6jr6dt3kg560649 | 235 | 0.1% |
| 1fter1eh1lla36301 | 231 | 0.1% |
| 5tftx4cn3ex042751 | 227 | 0.1% |
| 1gchtce37g1186784 | 214 | 0.1% |
| 1gtn1teh5ez273019 | 207 | 0.1% |
| 3vwf17at1fm655022 | 199 | 0.1% |
| jn1az4eh8km420880 | 198 | 0.1% |
| 1ftmf1cp3gkd62143 | 195 | 0.1% |
| 1gtr1we07dz143724 | 194 | 0.1% |
| Other values (118236) | 263677 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 425101 | 9.4% |
| 2 | 279570 | 6.2% |
| 3 | 273707 | 6.1% |
| 5 | 273040 | 6.1% |
| 4 | 241769 | 5.4% |
| 0 | 241757 | 5.4% |
| 6 | 223745 | 5.0% |
| 7 | 209685 | 4.7% |
| 8 | 198070 | 4.4% |
| 9 | 172062 | 3.8% |
| Other values (28) | 1969776 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4508282 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 425101 | 9.4% |
| 2 | 279570 | 6.2% |
| 3 | 273707 | 6.1% |
| 5 | 273040 | 6.1% |
| 4 | 241769 | 5.4% |
| 0 | 241757 | 5.4% |
| 6 | 223745 | 5.0% |
| 7 | 209685 | 4.7% |
| 8 | 198070 | 4.4% |
| 9 | 172062 | 3.8% |
| Other values (28) | 1969776 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4508282 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 425101 | 9.4% |
| 2 | 279570 | 6.2% |
| 3 | 273707 | 6.1% |
| 5 | 273040 | 6.1% |
| 4 | 241769 | 5.4% |
| 0 | 241757 | 5.4% |
| 6 | 223745 | 5.0% |
| 7 | 209685 | 4.7% |
| 8 | 198070 | 4.4% |
| 9 | 172062 | 3.8% |
| Other values (28) | 1969776 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4508282 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 425101 | 9.4% |
| 2 | 279570 | 6.2% |
| 3 | 273707 | 6.1% |
| 5 | 273040 | 6.1% |
| 4 | 241769 | 5.4% |
| 0 | 241757 | 5.4% |
| 6 | 223745 | 5.0% |
| 7 | 209685 | 4.7% |
| 8 | 198070 | 4.4% |
| 9 | 172062 | 3.8% |
| Other values (28) | 1969776 |
drive
Categorical
High correlation  Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 130567 |
| Missing (%) | 30.6% |
| Memory size | 3.3 MiB |
| 4wd | |
|---|---|
| fwd | |
| rwd |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | rwd |
|---|---|
| 2nd row | 4wd |
| 3rd row | 4wd |
| 4th row | 4wd |
| 5th row | 4wd |
Common Values
| Value | Count | Frequency (%) |
| 4wd | 131904 | |
| fwd | 105517 | |
| rwd | 58892 | |
| (Missing) | 130567 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4wd | 131904 | |
| fwd | 105517 | |
| rwd | 58892 |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 296313 | |
| d | 296313 | |
| 4 | 131904 | |
| f | 105517 | 11.9% |
| r | 58892 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 888939 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| w | 296313 | |
| d | 296313 | |
| 4 | 131904 | |
| f | 105517 | 11.9% |
| r | 58892 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 888939 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| w | 296313 | |
| d | 296313 | |
| 4 | 131904 | |
| f | 105517 | 11.9% |
| r | 58892 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 888939 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| w | 296313 | |
| d | 296313 | |
| 4 | 131904 | |
| f | 105517 | 11.9% |
| r | 58892 | 6.6% |
size
Categorical
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 306361 |
| Missing (%) | 71.8% |
| Memory size | 3.3 MiB |
| full-size | |
|---|---|
| mid-size | |
| compact | |
| sub-compact | 3194 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.4452659 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | full-size |
|---|---|
| 2nd row | full-size |
| 3rd row | full-size |
| 4th row | full-size |
| 5th row | full-size |
Common Values
| Value | Count | Frequency (%) |
| full-size | 63465 | 14.9% |
| mid-size | 34476 | 8.1% |
| compact | 19384 | 4.5% |
| sub-compact | 3194 | 0.7% |
| (Missing) | 306361 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| full-size | 63465 | |
| mid-size | 34476 | |
| compact | 19384 | 16.1% |
| sub-compact | 3194 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 132417 | |
| l | 126930 | |
| - | 101135 | |
| s | 101135 | |
| z | 97941 | |
| e | 97941 | |
| u | 66659 | |
| f | 63465 | |
| m | 57054 | |
| c | 45156 | 4.4% |
| Other values (6) | 127982 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1017815 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 132417 | |
| l | 126930 | |
| - | 101135 | |
| s | 101135 | |
| z | 97941 | |
| e | 97941 | |
| u | 66659 | |
| f | 63465 | |
| m | 57054 | |
| c | 45156 | 4.4% |
| Other values (6) | 127982 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1017815 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 132417 | |
| l | 126930 | |
| - | 101135 | |
| s | 101135 | |
| z | 97941 | |
| e | 97941 | |
| u | 66659 | |
| f | 63465 | |
| m | 57054 | |
| c | 45156 | 4.4% |
| Other values (6) | 127982 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1017815 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 132417 | |
| l | 126930 | |
| - | 101135 | |
| s | 101135 | |
| z | 97941 | |
| e | 97941 | |
| u | 66659 | |
| f | 63465 | |
| m | 57054 | |
| c | 45156 | 4.4% |
| Other values (6) | 127982 |
type
Categorical
High correlation  Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 92858 |
| Missing (%) | 21.8% |
| Memory size | 3.3 MiB |
| sedan | |
|---|---|
| SUV | |
| pickup | |
| truck | |
| other | |
| Other values (8) |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 4.9978534 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | pickup |
|---|---|
| 2nd row | pickup |
| 3rd row | pickup |
| 4th row | pickup |
| 5th row | truck |
Common Values
| Value | Count | Frequency (%) |
| sedan | 87056 | |
| SUV | 77284 | |
| pickup | 43510 | |
| truck | 35279 | 8.3% |
| other | 22110 | 5.2% |
| coupe | 19204 | 4.5% |
| hatchback | 16598 | 3.9% |
| wagon | 10751 | 2.5% |
| van | 8548 | 2.0% |
| convertible | 7731 | 1.8% |
| Other values (3) | 5951 | 1.4% |
| (Missing) | 92858 |
Length
| Value | Count | Frequency (%) |
| sedan | 87056 | |
| suv | 77284 | |
| pickup | 43510 | |
| truck | 35279 | |
| other | 22110 | 6.6% |
| coupe | 19204 | 5.7% |
| hatchback | 16598 | 5.0% |
| wagon | 10751 | 3.2% |
| van | 8548 | 2.6% |
| convertible | 7731 | 2.3% |
| Other values (3) | 5951 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 144985 | 8.7% |
| e | 143832 | 8.6% |
| c | 138920 | 8.3% |
| n | 123736 | 7.4% |
| p | 106224 | 6.4% |
| u | 98510 | 5.9% |
| k | 95387 | 5.7% |
| d | 87665 | 5.3% |
| s | 87573 | 5.2% |
| t | 81718 | 4.9% |
| Other values (15) | 560843 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1669393 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 144985 | 8.7% |
| e | 143832 | 8.6% |
| c | 138920 | 8.3% |
| n | 123736 | 7.4% |
| p | 106224 | 6.4% |
| u | 98510 | 5.9% |
| k | 95387 | 5.7% |
| d | 87665 | 5.3% |
| s | 87573 | 5.2% |
| t | 81718 | 4.9% |
| Other values (15) | 560843 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1669393 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 144985 | 8.7% |
| e | 143832 | 8.6% |
| c | 138920 | 8.3% |
| n | 123736 | 7.4% |
| p | 106224 | 6.4% |
| u | 98510 | 5.9% |
| k | 95387 | 5.7% |
| d | 87665 | 5.3% |
| s | 87573 | 5.2% |
| t | 81718 | 4.9% |
| Other values (15) | 560843 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1669393 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 144985 | 8.7% |
| e | 143832 | 8.6% |
| c | 138920 | 8.3% |
| n | 123736 | 7.4% |
| p | 106224 | 6.4% |
| u | 98510 | 5.9% |
| k | 95387 | 5.7% |
| d | 87665 | 5.3% |
| s | 87573 | 5.2% |
| t | 81718 | 4.9% |
| Other values (15) | 560843 |
paint_color
Categorical
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 130203 |
| Missing (%) | 30.5% |
| Memory size | 3.3 MiB |
| white | |
|---|---|
| black | |
| silver | |
| blue | |
| red | |
| Other values (7) |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.7906747 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | white |
|---|---|
| 2nd row | blue |
| 3rd row | red |
| 4th row | red |
| 5th row | black |
Common Values
| Value | Count | Frequency (%) |
| white | 79285 | |
| black | 62861 | |
| silver | 42970 | 10.1% |
| blue | 31223 | 7.3% |
| red | 30473 | 7.1% |
| grey | 24416 | 5.7% |
| green | 7343 | 1.7% |
| custom | 6700 | 1.6% |
| brown | 6593 | 1.5% |
| yellow | 2142 | 0.5% |
| Other values (2) | 2671 | 0.6% |
| (Missing) | 130203 |
Length
| Value | Count | Frequency (%) |
| white | 79285 | |
| black | 62861 | |
| silver | 42970 | |
| blue | 31223 | 10.5% |
| red | 30473 | 10.3% |
| grey | 24416 | 8.2% |
| green | 7343 | 2.5% |
| custom | 6700 | 2.3% |
| brown | 6593 | 2.2% |
| yellow | 2142 | 0.7% |
| Other values (2) | 2671 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 227866 | |
| l | 142025 | |
| i | 122255 | 8.6% |
| r | 114466 | 8.1% |
| b | 100677 | 7.1% |
| w | 88020 | 6.2% |
| t | 85985 | 6.0% |
| h | 79285 | 5.6% |
| c | 69561 | 4.9% |
| a | 64845 | 4.6% |
| Other values (11) | 326298 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1421283 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 227866 | |
| l | 142025 | |
| i | 122255 | 8.6% |
| r | 114466 | 8.1% |
| b | 100677 | 7.1% |
| w | 88020 | 6.2% |
| t | 85985 | 6.0% |
| h | 79285 | 5.6% |
| c | 69561 | 4.9% |
| a | 64845 | 4.6% |
| Other values (11) | 326298 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1421283 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 227866 | |
| l | 142025 | |
| i | 122255 | 8.6% |
| r | 114466 | 8.1% |
| b | 100677 | 7.1% |
| w | 88020 | 6.2% |
| t | 85985 | 6.0% |
| h | 79285 | 5.6% |
| c | 69561 | 4.9% |
| a | 64845 | 4.6% |
| Other values (11) | 326298 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1421283 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 227866 | |
| l | 142025 | |
| i | 122255 | 8.6% |
| r | 114466 | 8.1% |
| b | 100677 | 7.1% |
| w | 88020 | 6.2% |
| t | 85985 | 6.0% |
| h | 79285 | 5.6% |
| c | 69561 | 4.9% |
| a | 64845 | 4.6% |
| Other values (11) | 326298 |
state
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | az |
|---|---|
| 2nd row | ar |
| 3rd row | fl |
| 4th row | ma |
| 5th row | nc |
| Value | Count | Frequency (%) |
| ca | 50614 | 11.9% |
| fl | 28511 | 6.7% |
| tx | 22945 | 5.4% |
| ny | 19386 | 4.5% |
| oh | 17696 | 4.1% |
| or | 17104 | 4.0% |
| mi | 16900 | 4.0% |
| nc | 15277 | 3.6% |
| wa | 13861 | 3.2% |
| pa | 13753 | 3.2% |
| Other values (41) | 210833 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 137111 | |
| c | 91464 | |
| n | 80937 | 9.5% |
| i | 67266 | 7.9% |
| o | 56973 | 6.7% |
| m | 56562 | 6.6% |
| t | 49156 | 5.8% |
| l | 47049 | 5.5% |
| f | 28511 | 3.3% |
| w | 26921 | 3.2% |
| Other values (14) | 211810 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 853760 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 137111 | |
| c | 91464 | |
| n | 80937 | 9.5% |
| i | 67266 | 7.9% |
| o | 56973 | 6.7% |
| m | 56562 | 6.6% |
| t | 49156 | 5.8% |
| l | 47049 | 5.5% |
| f | 28511 | 3.3% |
| w | 26921 | 3.2% |
| Other values (14) | 211810 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 853760 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 137111 | |
| c | 91464 | |
| n | 80937 | 9.5% |
| i | 67266 | 7.9% |
| o | 56973 | 6.7% |
| m | 56562 | 6.6% |
| t | 49156 | 5.8% |
| l | 47049 | 5.5% |
| f | 28511 | 3.3% |
| w | 26921 | 3.2% |
| Other values (14) | 211810 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 853760 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 137111 | |
| c | 91464 | |
| n | 80937 | 9.5% |
| i | 67266 | 7.9% |
| o | 56973 | 6.7% |
| m | 56562 | 6.6% |
| t | 49156 | 5.8% |
| l | 47049 | 5.5% |
| f | 28511 | 3.3% |
| w | 26921 | 3.2% |
| Other values (14) | 211810 |
Interactions
Correlations
| condition | cylinders | drive | fuel | id | manufacturer | odometer | paint_color | price | size | title_status | transmission | type | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| condition | 1.000 | 0.079 | 0.099 | 0.154 | 0.066 | 0.083 | 0.031 | 0.071 | 0.007 | 0.038 | 0.135 | 0.383 | 0.141 | 0.118 |
| cylinders | 0.079 | 1.000 | 0.386 | 0.198 | 0.024 | 0.338 | 0.021 | 0.074 | 0.000 | 0.315 | 0.040 | 0.156 | 0.244 | 0.081 |
| drive | 0.099 | 0.386 | 1.000 | 0.162 | 0.006 | 0.459 | 0.014 | 0.120 | 0.000 | 0.225 | 0.037 | 0.113 | 0.548 | 0.183 |
| fuel | 0.154 | 0.198 | 0.162 | 1.000 | 0.053 | 0.356 | 0.011 | 0.090 | 0.000 | 0.145 | 0.025 | 0.254 | 0.243 | 0.080 |
| id | 0.066 | 0.024 | 0.006 | 0.053 | 1.000 | 0.086 | 0.045 | 0.026 | -0.079 | 0.007 | 0.013 | 0.046 | 0.050 | -0.085 |
| manufacturer | 0.083 | 0.338 | 0.459 | 0.356 | 0.086 | 1.000 | 0.013 | 0.100 | 0.000 | 0.257 | 0.037 | 0.198 | 0.266 | 0.099 |
| odometer | 0.031 | 0.021 | 0.014 | 0.011 | 0.045 | 0.013 | 1.000 | 0.009 | -0.457 | 0.004 | 0.031 | 0.024 | 0.008 | -0.651 |
| paint_color | 0.071 | 0.074 | 0.120 | 0.090 | 0.026 | 0.100 | 0.009 | 1.000 | 0.000 | 0.080 | 0.023 | 0.134 | 0.094 | 0.085 |
| price | 0.007 | 0.000 | 0.000 | 0.000 | -0.079 | 0.000 | -0.457 | 0.000 | 1.000 | 0.000 | 0.000 | 0.007 | 0.000 | 0.491 |
| size | 0.038 | 0.315 | 0.225 | 0.145 | 0.007 | 0.257 | 0.004 | 0.080 | 0.000 | 1.000 | 0.021 | 0.133 | 0.333 | 0.045 |
| title_status | 0.135 | 0.040 | 0.037 | 0.025 | 0.013 | 0.037 | 0.031 | 0.023 | 0.000 | 0.021 | 1.000 | 0.061 | 0.031 | 0.081 |
| transmission | 0.383 | 0.156 | 0.113 | 0.254 | 0.046 | 0.198 | 0.024 | 0.134 | 0.007 | 0.133 | 0.061 | 1.000 | 0.284 | 0.256 |
| type | 0.141 | 0.244 | 0.548 | 0.243 | 0.050 | 0.266 | 0.008 | 0.094 | 0.000 | 0.333 | 0.031 | 0.284 | 1.000 | 0.093 |
| year | 0.118 | 0.081 | 0.183 | 0.080 | -0.085 | 0.099 | -0.651 | 0.085 | 0.491 | 0.045 | 0.081 | 0.256 | 0.093 | 1.000 |
Missing values
Sample
| id | region | price | year | manufacturer | model | condition | cylinders | fuel | odometer | title_status | transmission | VIN | drive | size | type | paint_color | state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 7222695916 | prescott | 6000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | az |
| 1 | 7218891961 | fayetteville | 11900 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ar |
| 2 | 7221797935 | florida keys | 21000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | fl |
| 3 | 7222270760 | worcester / central MA | 1500 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ma |
| 4 | 7210384030 | greensboro | 4900 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | nc |
| 5 | 7222379453 | hudson valley | 1600 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ny |
| 6 | 7221952215 | hudson valley | 1000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ny |
| 7 | 7220195662 | hudson valley | 15995 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ny |
| 8 | 7209064557 | medford-ashland | 5000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | or |
| 9 | 7219485069 | erie | 3000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | pa |
| id | region | price | year | manufacturer | model | condition | cylinders | fuel | odometer | title_status | transmission | VIN | drive | size | type | paint_color | state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 426870 | 7301592119 | wyoming | 22990 | 2020.0 | hyundai | sonata se sedan 4d | good | NaN | gas | 3066.0 | clean | other | 5NPEG4JAXLH051710 | fwd | NaN | sedan | blue | wy |
| 426871 | 7301591639 | wyoming | 17990 | 2018.0 | kia | sportage lx sport utility 4d | good | NaN | gas | 34239.0 | clean | other | KNDPMCAC7J7417329 | NaN | NaN | SUV | NaN | wy |
| 426872 | 7301591201 | wyoming | 32590 | 2020.0 | mercedes-benz | c-class c 300 | good | NaN | gas | 19059.0 | clean | other | 55SWF8DB6LU325050 | rwd | NaN | sedan | white | wy |
| 426873 | 7301591202 | wyoming | 30990 | 2018.0 | mercedes-benz | glc 300 sport | good | NaN | gas | 15080.0 | clean | automatic | WDC0G4JB6JV019749 | rwd | NaN | other | white | wy |
| 426874 | 7301591199 | wyoming | 33590 | 2018.0 | lexus | gs 350 sedan 4d | good | 6 cylinders | gas | 30814.0 | clean | automatic | JTHBZ1BLXJA012999 | rwd | NaN | sedan | white | wy |
| 426875 | 7301591192 | wyoming | 23590 | 2019.0 | nissan | maxima s sedan 4d | good | 6 cylinders | gas | 32226.0 | clean | other | 1N4AA6AV6KC367801 | fwd | NaN | sedan | NaN | wy |
| 426876 | 7301591187 | wyoming | 30590 | 2020.0 | volvo | s60 t5 momentum sedan 4d | good | NaN | gas | 12029.0 | clean | other | 7JR102FKXLG042696 | fwd | NaN | sedan | red | wy |
| 426877 | 7301591147 | wyoming | 34990 | 2020.0 | cadillac | xt4 sport suv 4d | good | NaN | diesel | 4174.0 | clean | other | 1GYFZFR46LF088296 | NaN | NaN | hatchback | white | wy |
| 426878 | 7301591140 | wyoming | 28990 | 2018.0 | lexus | es 350 sedan 4d | good | 6 cylinders | gas | 30112.0 | clean | other | 58ABK1GG4JU103853 | fwd | NaN | sedan | silver | wy |
| 426879 | 7301591129 | wyoming | 30590 | 2019.0 | bmw | 4 series 430i gran coupe | good | NaN | gas | 22716.0 | clean | other | WBA4J1C58KBM14708 | rwd | NaN | coupe | NaN | wy |